Stereoselective virtual screening of the ZINC database using atom pair 3D-fingerprints
نویسندگان
چکیده
BACKGROUND Tools to explore large compound databases in search for analogs of query molecules provide a strategically important support in drug discovery to help identify available analogs of any given reference or hit compound by ligand based virtual screening (LBVS). We recently showed that large databases can be formatted for very fast searching with various 2D-fingerprints using the city-block distance as similarity measure, in particular a 2D-atom pair fingerprint (APfp) and the related category extended atom pair fingerprint (Xfp) which efficiently encode molecular shape and pharmacophores, but do not perceive stereochemistry. Here we investigated related 3D-atom pair fingerprints to enable rapid stereoselective searches in the ZINC database (23.2 million 3D structures). RESULTS Molecular fingerprints counting atom pairs at increasing through-space distance intervals were designed using either all atoms (16-bit 3DAPfp) or different atom categories (80-bit 3DXfp). These 3D-fingerprints retrieved molecular shape and pharmacophore analogs (defined by OpenEye ROCS scoring functions) of 110,000 compounds from the Cambridge Structural Database with equal or better accuracy than the 2D-fingerprints APfp and Xfp, and showed comparable performance in recovering actives from decoys in the DUD database. LBVS by 3DXfp or 3DAPfp similarity was stereoselective and gave very different analogs when starting from different diastereomers of the same chiral drug. Results were also different from LBVS with the parent 2D-fingerprints Xfp or APfp. 3D- and 2D-fingerprints also gave very different results in LBVS of folded molecules where through-space distances between atom pairs are much shorter than topological distances. CONCLUSIONS 3DAPfp and 3DXfp are suitable for stereoselective searches for shape and pharmacophore analogs of query molecules in large databases. Web-browsers for searching ZINC by 3DAPfp and 3DXfp similarity are accessible at www.gdb.unibe.ch and should provide useful assistance to drug discovery projects. Graphical abstractAtom pair fingerprints based on through-space distances (3DAPfp) provide better shape encoding than atom pair fingerprints based on topological distances (APfp) as measured by the recovery of ROCS shape analogs by fp similarity.
منابع مشابه
EpiDBase: a manually curated database for small molecule modulators of epigenetic landscape
We have developed EpiDBase (www.epidbase.org), an interactive database of small molecule ligands of epigenetic protein families by bringing together experimental, structural and chemoinformatic data in one place. Currently, EpiDBase encompasses 5784 unique ligands (11 422 entries) of various epigenetic markers such as writers, erasers and readers. The EpiDBase includes experimental IC(50) value...
متن کاملLearning Deep Architectures for Interaction Prediction in Structure-based Virtual Screening
We introduce a deep learning architecture for structure-based virtual screening that generates fixed-sized fingerprints of proteins and small molecules by applying learnable atom convolution and softmax operations to each compound separately. These fingerprints are further transformed non-linearly, their inner-product is calculated and used to predict the binding potential. Moreover, we show th...
متن کاملPharmacophore Based Virtual Screening Approach to Identify Selective PDE4B Inhibitors
Phosphodiesterase 4 (PDE4) has been established as a promising target in asthma andchronic obstructive pulmonary disease. PDE4B subtype selective inhibitors are known toreduce the dose limiting adverse effect associated with non-selective PDE4B inhibitors. Thismakes the development of PDE4B subtype selective inhibitors a desirable research goal. Toachieve this goal, ligand based pharmacophore m...
متن کاملPharmacophore Based Virtual Screening Approach to Identify Selective PDE4B Inhibitors
Phosphodiesterase 4 (PDE4) has been established as a promising target in asthma andchronic obstructive pulmonary disease. PDE4B subtype selective inhibitors are known toreduce the dose limiting adverse effect associated with non-selective PDE4B inhibitors. Thismakes the development of PDE4B subtype selective inhibitors a desirable research goal. Toachieve this goal, ligand based pharmacophore m...
متن کاملChemoPy: freely available python package for computational biology and chemoinformatics
MOTIVATION Molecular representation for small molecules has been routinely used in QSAR/SAR, virtual screening, database search, ranking, drug ADME/T prediction and other drug discovery processes. To facilitate extensive studies of drug molecules, we developed a freely available, open-source python package called chemoinformatics in python (ChemoPy) for calculating the commonly used structural ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 7 شماره
صفحات -
تاریخ انتشار 2015